On the potential of glottal signatures for speaker recognition

نویسندگان

Thomas Drugman

Thierry Dutoit

چکیده

Most of current speaker recognition systems are based on features extracted from the magnitude spectrum of speech. However the excitation signal produced by the glottis is expected to convey complementary relevant information about the speaker identity. This paper explores the use of two proposed glottal signatures, derived from the residual signal, for speaker identification. Experiments using these signatures are performed on both TIMIT and YOHO databases. Promising results are shown to outperform other approaches based on glottal features. Besides it is highlighted that the signatures can be used for text-independent speaker recognition and that only several seconds of voiced speech are sufficient for estimating them reliably.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Advances in Glottal Analysis and its Applications

From artificial voices in GPS to automatic systems of dictation, from voice-based identity verification to voice pathology detection, speech processing applications are nowadays omnipresent in our daily life. By offering solutions to companies seeking for efficiency enhancement with simultaneous cost saving, the market of speech technology is forecast to be particularly promising in the next ye...

متن کامل

Glottal modeling and closed-phase analysis for speaker recognition

This paper concerns the application of glottal models and closed-phase analysis to the problem of speaker recognition. A glottal model based on one originally proposed by Fujisaki and Ljungqvist was used in conjunction with closed-phase analysis to yield features for a speaker recognition system used in the NIST 2003 Speaker Recognition Evaluation. Scores from the system based on the glottal mo...

متن کامل

Speaker Verification Using the Shape of the Glottal Excitation Function for Vowels

This paper seeks to establish a baseline for the potential contribution of the shape of the glottal source waveform to speaker recognition. A text-dependent speaker verification experiment was performed with 4 monosyllabic words spoken repeatedly by the 16 speakers of the TI46 speech data corpus. A single fundamental period was automatically extracted from each vowel centre and inverse-filtered...

متن کامل

Speaker Adaptation in Continuous Speech Recognition Using MLLR-Based MAP Estimation

A variety of methods are used for speaker adaptation in speech recognition. In some techniques, such as MAP estimation, only the models with available training data are updated. Hence, large amounts of training data are required in order to have significant recognition improvements. In some others, such as MLLR, where several general transformations are applied to model clusters, the results ar...

متن کامل

Glottal Waveforms for Speaker Inference & A Regression Score Post-Processing Method Applicable to General Classification Problems

Contributions are made along two main lines. Firstly a method is proposed for using a regression model to learn relationships within the scores of a machine learning classifier, which can then be applied to future classifier output for the purpose of improving recognition accuracy. The method is termed r-norm and strong empirical results are obtained from its application to several text-indepen...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2010

On the potential of glottal signatures for speaker recognition

نویسندگان

چکیده

منابع مشابه

Advances in Glottal Analysis and its Applications

Glottal modeling and closed-phase analysis for speaker recognition

Speaker Verification Using the Shape of the Glottal Excitation Function for Vowels

Speaker Adaptation in Continuous Speech Recognition Using MLLR-Based MAP Estimation

Glottal Waveforms for Speaker Inference & A Regression Score Post-Processing Method Applicable to General Classification Problems

عنوان ژورنال:

اشتراک گذاری